Covariant Symmetry Classifications for Observables of Cosmological 

Birefringence 



John P. Ralston 

Department of Physics and Astronomy, Kansas University, Lawrence, KS-660^5, USA 

ralston@kuphsx.phsx.ukans.edu 

and 

Pankaj Jain 

' Physics Department, I.I.T. Kanpur, India 208016 

^\ , pkjain@iitk.ac.in 



OO 



I. INTRODUCTION 



CIh 1 Polarizations of electromagnetic waves from distant galaxies are known to be correlated with the 

source orientations. These quantities have been used to search for signals of cosmological birefrin- 
gence. We review and classify transformation properties of the polarization and source orientation 
observables. The classifications give a firm foundation to certain practices which have sprung up 
£Nj ' informally in the literature. Transformations under parity play a central role, showing that parity 

violation in emission or in the subsequent propagation is an observable phenomenon. We also discuss 
statistical measures, correlations and distributions which transform properly and which can be used 
for systematic data analysis. 
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Galaxies at cosmological distances are known to emit polarized electromagnetic radiation according to certain 
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statistical regularities. The processes range from linear polarization from radio galaxies due to synchrotron radiation, 
to centro-symmetric polarizations sometimes seen in optical measurements of QSO's. The observables can include 
O the percentage of polarization, a linear polarization angle, a galaxy symmetry axis, and the redshift of the galaxy, as 
.1 | well as the angular position coordinates of the galaxy on the dome of the sky. One typical correlation is an observed 
alignment of the bisector of a radio galaxy's long axis and linearly polarized radiation in the "FR1" class of objects. 
This has been interpreted as probing the source's magnetic field in conventional models. While this effect is in the 
C$ context of conventional physics, other polarization correlations can test extremely small effects that might indicate 
new physics-sensitive to "cosmological birefringence" - that might occur in the subsequent propagation. This has led 
to several interesting studies, from seminal early work on propagation in parity- violating electro- weak media by Leo 
' Stodolsky jjj, to the discussion of "optical activity of the universe" by Gabriel Karl and collaborators to recent 
studies of possible cosmological anisotropy in propagation (^^] . 

Galaxy and polarization axes are labeled by angles, typically measured in "degrees East of North" in a spherical 
-polar system of angular coordinates. The galaxy major axis projected into the tangent plane is measured with an 
angle %p. A linearly polarized electric field orientation in the same coordinates is assigned polarization angle x ■ The 
angles, however, do not represent vectors in the rigorous sense, because a rotation of 7r radians returns an axis to 
its original orientation. These observables are described by elements of the real projective group, which is not very 
familiar in physics. In this paper we will investigate and then automate proper transformation properties of such 
observables, and list quantities which are invariant under different symmetries. This provides a firm foundation for 
certain traditions and informal uses of variables which have sprung up in the literature. Rather interestingly, the 
split-up of the three dimensional orthogonal group into disconnected subgroups of different parity also occurs for 
observables connected with polarization. One then has quantities which are even or odd under parity, which extends 
the list of symmetries one can test using polarization data. 

The physical motivation for this is as follows. Consider a linear polarization correlated with an emission axis. From 
"symmetry", statistics would naively be expected to show equal offsets of polarization axis relative to galaxy axis 
for either a positive or a negative sense of rotation. The particular "symmetry" assumed here is parity symmetry. 
While the emergence of parity as an observable may be surprising, it should be intuitively clear that observation 
of the converse would allow one to extract a definition of right-versus -left handed senses of rotation from a data 
set, breaking parity symmetry. An even more obvious violation of parity symmetry would be preference for one 
kind of circular polarization over another. While one may not expect an object the size of a galaxy to break parity 
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symmetry in its emission characteristics, the possibility constitutes a valid observable, subject to classification and 
study. Another more interesting possibility, moreover, is to assume that objects do respect parity symmetry in a 
statistical sense upon emission, while the intervening medium can be tested for parity violation to an extremely fine 
degree. Many models predict just such effects H,||. We will proceed by setting up the transformation properties, 
listing covariant and invariant quantities, and illustrating the remarks by some statistical quantities which embody 
the ideas for data analysis. 

II. SETUP 

Let us discuss transformation properties. We are not concerned with Lorentz boosts, nor the usual Lorentz covariant 
characterization of radiation in terms of the energy momentum tensor, etc. Our focus is the "little group" of angular 
variables in the rest frame of the observer, and the peculiar fact that the angles \ and "0 actually observed give the 
orientation of "sticks" . A rotation of just 7r radians about the axis of propagation makes no observable effect. Due 
to this unusual transformation property, both the electric field and the galaxy axis are conveniently represented by 
certain tensors. 

Let p be the radial unit vector pointing toward a particular source. Construct a local Cartesian frame with its 
z-axis oriented along p (Fig 1). The 2-dimensional subspace orthogonal to p is a plane tangent to the unit sphere 
of the sky. Any consistent orientation of the local x and y axes spanning the tangent plane can be used; one can 
always take these to be in the directions of increasing azimuthal and polar angles (4> and 9) relative to a global North 
pole (hence the astronomer's "East of North"). The electric field (E x ,E y ,0) is transverse to p and conventionally 
represented by a 2 component complex doublet, a convention hearkening back to optics 0. In the circular polarization 
basis, \E > = \\f2(E x + iE y ,E x — iE y ). The doublet notation is a hybrid and the transformation properties need 
examination. 

First, in averaging over many cycles of the wave, the observer does not actually measure the electric field. Instead 
what can be measured is an Hermitian density matrix (?]] EiE* = J{j — (J un poiz + Jpoiz)ij- In our local coordinates, 
the non-zero entries of J are a 2 x 2 block in the upper left hand corner of a 3 x 3 matrix. If the coordinate system 
is rotated, this tensor will transform in the usual way, because Ei and Ej transform like 3-vectors. 

The unpolarized part J un poiz is proportional to "1" with a proportionality constant fixed by the total power. The 
polarization density matrix J po i z is defined |t]] to be the rest of the matrix, "as if it were a pure state. Thus 
det(J po i z ) = 0. This matrix is conventionally expanded in Pauli matrices: 

Jpolz — ~^Pmax{^ ~l~ X ' — ~^Pmax{^ ~t~ A ) 

where P ma x is the degree of polarization, x is a unit-3 vector, and a are the Pauli matrices. Equivalent are the 
"Poincare" coordinates d p and x (for describing elliptical and circular polarization Q]): 

i _ ( cos § p sin § p e 2lx \ 
* ~~ \sa\d p e~ 2lx -costfp ) 

We have introduced the common "slash" notation for contraction of a vector with the Pauli matrix. We can also use 
^ for the 3x3 matrix when no confusion between the two is possible. The polarization parameters have now been 
organized in a form convenient for transformations. 

Consider the important case of a rotation of tangent plane coordinates about the propagation axis, that is the local 
z-axis, \E >— - > U\E >, with U 3 x 3 and in 5*0(3). U takes the block form U = (u^, 1) where is 2 x 2 and unitary, in 
fact an element of SO(2). Of course y[ — * u^u\- This is the same rule as the rotations in the spin 1/2 representation, 
but in this case operating on vector components, not spinors. It follows that the parameters x transform with the 
angle of rotation doubled. As a check, a short calculation shows that a linear polarization oriented at angle x relative 
to the local x-axis gives x — ( c °s2x, sin2x,0) in the local basis. The factor of "2" is just right to account for the 
periodicity of the observable plane of the electric field under rotations by ir (as opposed to 2tt) . This supports the 
informal use of x m literature ranging from biology to astronomy flQl , where the "2" is inserted intuitively and 
to make things come out right while making it clear that x is n °t a true vector. 

The result is elucidated by the following general argument. Recall the familiar decomposition of angular momentum 
(j) states for the product \E > x < E\ = 1x1 = 2 + 1 + 0. The states are labeled by (m, j) representing the 
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eigenvalues of j z and j 2 , respectively. However, since E is transverse, all tensor products made from m — are absent. 
From the addition rule for m, these absent representations are the m = 0, m = ±1 states of j = 2, and the m = ±1 
states of j = 1. This eliminates 5 of 9 possible combinations, leaving 4 total which are (2, 2), (— 2, 2), (0, 1), (0, 0). 
The 4 possibilities are the 4 Stokes parameters, with the total power E ■ E* being the singlet (0, 0). The object x, a 
deceptively vector-like position on the Poincare sphere, is made from the peculiar combination (2,2), (—2,2), (0, 1). 
Making a short calculation in the helicity basis, iP max cos(i? p ) = l/2Tr[a z J] = i(E + E+ - E_E*_) = (E x E*) z is the 
(0, 1) component, showing that the ellipticity transforms like a 3- vector attached to the photon's direction of motion. 

A similar treatment is needed for a galaxy axis. An observed galaxy axis is the projection of the physical major 
axis (a signless eigenvector of a 3 x 3 tensor of intensity distribution, say) onto the tangent plane. We can again 
define in a helicity basis a 2 component \ip >= (cos-0 + i simp, cos-0 — i sinf/^/v^- Again \ip > is not a good variable, 
because the observable is actually projective, and we must identify \ip >= —\ip >. The associated 2x2 matrix, 
\ip >< ip\ = \ + is part of a tensor embedded in a 3 x 3 matrix. In our coordinates it can be expanded in Pauli 
matrices 

/ 1 2«/>\ I I 

\*>«>\ = 1 fi{h.c 1 ) = 2 1+ 2^ 

The matrix if) transforms like u^u^ and defines a 3 component thing, %p = (cos 2-0, sin 2tp, 0) . Note again the doubling 
of angles in 0, representing the information that \ip > and — \ip > are now identified, just as informally practiced in 
the literature. (It is interesting that if one could deduce information on the "pitch" of a galaxy, then ip could point 
out of the plane: such an axis might be "elliptically polarized", otherwise tp is in the local tangent plane.) 

We also have the position on the sky p, which is a true vector and not to transform with doubled angles. 

Finally we should classify the objects under parity: by definition we have that p — ► —p. tp and x are only a little 
more work: since ipj = l/2Tr[jpaj], with $ invariant, then and x are invariant under parity (although transforming 
in a complicated way under the full 0(3)). 

Now one can transform away from the special coordinate system, using general 3x3 elements of SO(3), with the 
three objects ^, ]f>, and p all transforming properly. 

III. COVARIANTS AND INVARIANTS 

We now turn to invariants one can make from the matrices j( and p. A "local" quantity will be one made from 
a single source, or (if possible) different sources at the same location on the dome of the sky; a "non-local" quantity 
anything made from sources at different locations. 

Local quantities: Since for any galaxy i a coordinate system exists where all the matrix elements are in the upper 
left, then we have the covariant identity for such 3x3 matrices 

/I? = A ■ B 1 + ifl; C = Ax B (1) 

The "1" of course means the 2x2 unit matrix on the upper left, covariantly written as Sij — Piftj. As a consequence 
of this identity an obvious invariant is reduced to a simpler form: 

*i - \ttW] = \ttW) = x ■ j>. 

It follows that Si = cos(2(x — ipj), which is clearly invariant under the local rotation x — > X + <^V' — + K al so 
follows that si is even under parity. 

Another useful quantity is the anti-symmetric 3x3 commutator [X;,j/>]. This is dual to a pseudo- vector: 

A t = l^TriaM (2) 

where is the completely anti-symmetric matrix with j, k elements e[ k . In our local coordinate frame, Ai points 
in the direction of p and is proportional to sin(2(x — ip)). The sign of proportionality depends on the right-handed 
convention for angles. This remains true however the coordinate system is rotated. The epsilon-tensor is even under 
parity, making it clear that A is even and therefore a pseudo- vector by construction. Since any quantity which is odd 
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in (x — iff) reverses when a positive "sense" of rotation is reversed to a negative one, such quantities are parity-odd 
on general grounds. 

This is not the only odd-parity observable. The helicity h of the wave is defined in a Lorentz-covariant manner as 
the projection of its spin along its direction of propagation. This is a pseudo-scalar: it must therefore be equal (up 
to a constant) to the product x ' P- The other invariants which can be made by contracting p and the matrices are 
trivially zero. Consulting the identity (1) above, there are 4 real- valued quantities in the products of the 2 matrices, 
which have now been classified into one scalar and one pseudo- vector, exhausting the possibilities for local bilinears 
of the two tensors. One can go further, and add another unit vector A to the problem. Such a vector is needed to 
quantify asymmetries of angular distribution. Then one can make a scalar S2 and a pseudoscalar p\\ 

S2 = p ■ A = cos(#); pi = A ■ A = sin(2(x — ip)) cos(0) 

where 9 is the angle between A and the position of the source on the dome of the sky. The pseudovector combination 
px A can also be considered, which makes the usual unit vector 8 sitting in the tangent plane, transforming like (±1, 1). 
By the j z addition rule for angular momentum, this cannot be combined with the (2, 2), (—2, 2), (0, 1) representations 
available from ^ and ]p to make new invariants. Using p, which transforms like (0, 1), we can take the (0, 1) part of ^ 
and make a pseudo-scalar, but this is the helicity h already discussed. 

Continuing in this way, products of higher order can always be reduced to sums of smaller dimensional repre- 
sentations, much like the usual decomposition of angular momentum. Some care is needed, however, because the 
polarization variables are made from incomplete representations because the fields are transverse. One generally then 
has fewer invariants than straightforward counting using rotation group methods might indicate. 

Non-Local Invariants: Combining different sources leads to some interesting quantities at low order. There is 
for example the familiar angle 8ij between galaxy i and galaxy j on the sky, given in terms of the true scalar 
cos(%) = pi -p-j. 

More interesting are symmetric and antisymmetric combinations of ^ and ^ • , or ^ ■ . It is straightforward to treat 
these by standard methods: thus the symmetric combinations 

Trlifijl; TrU^A; Trfe^]; 

are scalars; while the antisymmetric combinations or commutators 

UvijYi livtfjY, [tfvfj]; 

are dual to pseudo-vectors. 

The quantities above are involved in questions of angular coherence: for example, if one wants to smooth the 
angular distribution of sources at different locations, while it would be a bit sloppy to add the parameters such as x, 
in practice we are aware only of averaging polarization parameters over single sources of small angular dimensions, in 
a quasi-local way, so that this should not cause a problem. However if one were making a more ambitious study -for 
example examining the degree of coherent polarization over larger regions of the sky-then more formal care would 
be needed. One can, for example, create correlation functions of the matrices in a consistent basis and evaluated at 
different angular positions, which are then reduced to scalars by taking traces. Another interesting quantity is the 
hihj, or helicity-helicity correlation function, and obvious generalizations removing the means. This quantity does not 
need a galaxy axis for its evaluation, and might probe parity-violating effects of the medium evaluated as a function 
of angular scales. This may be useful for optical polarizations , for example, which often are not associated with axial 
structure. An application of interest is the cosmic micro-wave background, for which polarization measurements can 
be expected in the future. 

Distributions 

The classifications are useful in constructing statistical distributions or correlations which can be used to quantify 
observations. We will use a bracket ( <> ) symbol to denote an expectation value in a normalized distribution. 

Consider the problem of quantifying the correlations between x an d ip mentioned in the Introduction. From our 
results, one would naturally assume the distribution to be a function of the rotationally invariant quantity f3 = ip. 
Given that x an d ip are defined up to multiples of tt, the difference ranges in the most general case over 2ir, and not 
7r, as sometimes assumed pT[ in the literature. This is seen very simply by making sketches of some trial distributions. 
Analytically one can expand the distribution f(x, ip) in Fourier series for integer n,m: 

f(x> VP = ^ fnm exp(z2nx + i2mip) = f nm exp [i(n - m)(x - ip) + i(n + m)(x + ip)} 
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Reality of / must also be imposed; it prescribes the negative integer values of f nm , but does not restrict whether 
n ± to is even or odd. The coefficients f nm for odd values of n — to make [3 — \ — ip periodic on the interval of 27r, 
as claimed. Restricting non-zero coefficients to even values of n — to can be motivated by extra assumptions. One 
sufficient condition is that the distribution obeys overall rotational symmetry, giving f nm going like S(n + to). Such 
an assumption might seem very general but in fact it is not. It is an interesting fact of optics that even a perfectly 
transmitting (unitary) medium can treat different polarizations dissimilarly, leading to a non-trivial distribution of 
X + ip as a signal. The same mechanism can create an anisotropic distribution of linear polarizations along a pencil 
through a medium from an random uncorrelated distributed set of linear polarization emitters. To simplify the 
analysis we will assume here, however, that the distribution of (3 is periodic on the interval of ir. 

To make invariant distributions from invariants, this leaves us with the quantities constructed earlier, namely 
si = cos(2/3) and p\ = sin(2/3). A simple, and indeed well-known distribution that follows is the von Mises form 
HPHI^I : fvM{P) — const. exp(/ccos(2/3)) = const, exp(fcsi). This distribution has often been used in likelihood tests, 
but unfortunately without discussion of parity symmetry. By construction, the vM distribution embodies a physical 
assumption that the twist of one angle relative to another has no preferred parity, which may be unsuitable in some 
cases. The "shifted" von Mises distribution is similar: /shifted- v m(/3) = const. exp(fc cos(2(/3 — a)). The parameter a 
shifts the origin of (3 and might seem to be free. However if parity symmetry is assumed, then a is quite restricted. The 
distribution is a function of cos(2/3 — 2a) — cos(2/3) cos(2a)+sin(2/3) sin(2a). Parity symmetry requires f{(3) = /(— j3), 
yielding sin(2a) = 0, or a = 0, ir/2. Parity symmetry, then, is sufficient to predict that marginal distributions of linear 
polarizations tend to be oriented either along galaxy major axes, or perpendicular. This has been a misunderstood 
point, because of assertions that only the single choice of angular origin relative to the perpendicular is sensible. 
For any statistic which is covariant under a shift of origin, however, the matter is irrelevant, making no difference. 
Transforming (3 — > (3 — tt/2 is equivalent to transforming k — > — k in the vM or shifted vM distributions, showing that 
either choice is equally well described automatically. Similarly, sin(2/3) is odd in both (3 and (3 — 7r/2, a pseudoscalar 
no matter how angles are measured. Another example of the issue is underscored by the paper of Loredo et al. (ilf | 
using the shifted vM distribution in a detailed likelihood analysis. The paper responded to an odd-parity statistic 
used in Ref. but exclusively used an even parity shifted vM model in its analysis. In replacing the original statistic 
by one with opposite transformation properties, conclusions were drawn on a false basis. This illustrates the problem 
that can occur when using functions from a different symmetry class than the idea being tested. 

There is in fact a long history of mix-ups from lumping together observables of different parity, which has led to 
interesting contention in the literature. Birch |Q in 1982 empirically observed a pseudovector correlation in radio 
polarizations, odd in the difference (\ — V0- Kendall and Young (KY) made a model distribution to explore this 
p5[ , choosing vM for the null distribution of (3. This is an implicit assumption of parity conservation in the null, 
which is perfectly physical (but which should be stated explicitly) . KY then set up a conditional correlation function 
Cky(P,P) = exp(u\ ■ psm(2(3)) which involved odd-parity to test Birch's pseudovector result. Unfortunately Kendall 
and Young did not explain the reason for their ansatz, which in retrospect was entirely appropriate. The overall KY 
distribution finally can be written in a nicely compact form 

/ky(x, ^iP) = exp(kcos(2(3) + u\ ■ psin(2/3)) = exp(fcsi + upi) 

where si and p\ are our scalar and pseudoscalars from the first section, making it clear that a is a parity-violation 
anisotropic correlation parameter. Likelihood analysis, for example, can be used in a perfectly objective way to see 
whether any parity-violating effects might be present or not in a data set, and entirely separate from the need to 
quantify the marginal distribution of (3. It is interesting that Kendall and Young's data analysis jl5| along such lines 
then indicated a high statistical significance for the parity -violating effect Birch had observed; more recent work 
in this regard can also be cited 0]. However, other independent studies which used invariant correlations lumping 
together different parity seemed to contradict the result, a topic to which we now turn. 

Correlations 

One of the best-known statistical correlation tests comes from an influential paper by Jupp and Mardia fl^ j . Their 
prescription for correlation between 2 angular quantities involves mapping them into "vectors" v, w, that is covariantly 
transforming elements, of which \ and tp are examples we have already seen. To test for correlations, JM use canonical 
p x q correlation matrices 

=< (vi- < Vi >)(wj- < Wj >) > 

which behave under the separate transformations of v, w as the indices would indicate. Then the JM correlation test 
is to calculate 

np 2 v w = n Tr[E vw (J: ww )- 1 T, wv (E m )- 1 }, 
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with matrix products indicated, and where n is the number of data points in the sample. The step of dividing by 
auto-correlation matrices is used to make the quantity scale-invariant. One easily finds that np\ w = if v and w are 
independent, and the distribution of fluctuations in an uncorrelated null distribution has also be obtained ]l6| ]. Thus 
np 2 v w has served for many years as a useful simple test for independence. 

Unfortunately Jupp and Mardia, in discussing their correlation test, did not discuss parity and other discrete 
symmetries. In the problem at hand, to test whether the difference (3 is correlated with sky position, one might 
consider the JM correlation of a "natural vector" v = (cos(2/3), sin(2/3)) and w = p. This particular combination v 
was used to test Birch's correlation by Bietenholz and Kronberg Jl7| . Recall, however, the even-odd rule of parity for 
those quantities even-or odd in (3. It is clear that the above "natural" vectorlike combination v needlessly mixes two 
quantities which are of opposite parity. The two pieces are also separately invariant under rotations, and should not 
be combined into a vector. Ironically, history shows that the mixed parity combination was used while at the same 
time citing that Birch had a correlation of pseudovector character p7[ ; without recognizing the parity properties of 
Kendall and Young's procedure, both their and Birch's results were then rejected. From our analysis it is sufficient 
to use np 2 , 1 w and np 2 2 w , which allows separate tests of scalar or pseudoscalar kind. One can, of course, also work 
directly with scalar and pseudoscalar measures such as < s\ > and < p\ > to explore certain features of data, and 
create any number of statistics, once the proper transformation properties have been respected. 



IV. SUMMARY 



We have classified several combinations of the polarization and angular correlation observables under parity and 
angular momentum. The features of local observables, and their simple appearance in classic distributions and 
correlation analysis, should be helpful to those interested in the area. While not exploring very far into the non-local 
quantities, they appear to offer many possibilities for interesting studies. The possibility of exploring parity violation 
when there are no obvious emission axis variables, as in the case of upcoming polarized cosmic -microwave background 
observations from ground-based and satellite facilities, seems intriguing and bears further investigation. 

ACKNOWLEDGMENTS: This work was supported in part under DOE grant number 85ER40214, by the University 
of Kansas General Research Fund, and the Kansas Institute for Theoretical and Computational Science/ K*STAR 
program. 
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FIG. 1. Schematic diagram of observables. The variables \ an d 4> are orientation angles of the plane of electric field linear 
polarization, and of a source symmetry axis, respectively. A unit vector pointing towards the source on the dome of the sky 
(big sphere) is denoted by p. The polarization density matrix assigns a standard quantity s on the Poincare sphere (small 
sphere). The projection of s in the tangent plane transforms like spin-2 under rotations about the local p axis; the component 
of s along p is a measure of ellipticity, and transforms under a spin-1 representation. 
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